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1 Introduction 

The problem of reaching a consensus in a group of autonomous agents has been the 
object of study in a number of situations ranging from linguistics [11, 18, 20] to 
distributed computing [25, 26] and from physics [27] to animal behavior [6, 13, 23, 
24]. 

Research on the latter attempts to explain, by appropriately modeling it, the 
observed behavior of a group of animals, say a flock of birds, whose velocities con- 
verge to a common one. An influential model for this behavior has been postulated 
in [27] by Vicsek and collaborators and studied in [17] where convergence is shown 
under some conditions on the sequence of states of the flock. Also, the work of Buhl 
et al. [5] showed that Vicsek's model accurately describes the convergence to order 
in swarms of locusts. A different model for the same phenomenon was proposed 
in [9] and extended in [10] (to situations other than flocking) and in [22] (to flocking 
situations where a hierarchical leadership structure is present). A main feature of 
these papers is that, in contrast with the results in [17, 27], convergence results are 
established conditioned to the initial state of the flock only. 

The model in [9] postulates the following behavior: every bird adjusts its velocity 
by adding to it a weighted average of the differences of its velocity with those of the 
other birds. That is, at time t, and for bird i, 

k 

Vi{t + h) = Vi{t) + h'^aijivjit) - Vi{t)) (1) 
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where the weights {aij} quantify the way the birds influence each other, and h > is 
the time step. Vicsek's model is more general in the sense that allows the obtained 
Vi{t + h) to be perturbed by some centered noise. More precisely, it replaces (1) by 

k 

Vi{t + h) = Vi{t) + h^aij{vj{t) - Vi{t)) + hHi{t) (2) 
i=i 

where Hi{t) G E is some centered random variable modeling the noise. Here E 
denotes 3-dimensional Euclidean space. 

It is reasonable to assume that the influence weights function of the 

distance between birds. This is the case in [9, 27] and the major difference between 
these models is in the choice of the aij. In this paper we will follow [9] but slightly 
depart from it and take the adjacency matrix to have entries 

K 

- {i + \\x,-xj\\r 

for some fixed K > and a > 0. 

We can write the set of equalities (1) in a more concise form. Let be the 
k X k diagonal matrix whose ith diagonal entry is = X^j<j^. ciij and = — A^. 
Then (cf. [9]) 

v{t + h)- v(t) = -hL^v{t) + hH{t) 

where H = (Hi, . . . , Hj^). 

Note that the matrix notation Lxv{t) does not have the usual meaning of a A; x A; 
matrix acting on M^. Instead, the matrix is acting on E'^ by mapping {vi, ... , Vk) 

to i{Lx)iiVi + . . . + {Lx)ikVk)i<k- 

Adding a natural equation for the change of positions we obtain the discrete 
dynamical system 

x{t + h) = x(t) + hv{t) (D) 
v{t + h) = {Id- hL^)v{t) + hH{t). 

We also consider evolution for continuous time. The corresponding model is 
obtained by taking limits for /i — > and can be given by the system of differential 
equations 

x' = V (C) 
v' = -L^v + H. 

The main result in [9] shows, in the unperturbed case, for both discrete and 
continuous time, convergence to the alignment of the velocities. More precisely, 
convergence to a common velocity when the initial positions and velocities of the 
flock are not too dissimilar (for a > 1, otherwise, convergence holds uncondition- 
ally). For systems (D) and (C), due to the presence of noise, we can not expect 
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(3) 



convergence to a common velocity. Once the velocities {vi, . . . ,Vk} are similar 
enough compared with the noise the latter will, with positive probability, outdo the 
contractive character of the system. Perfect alignment as in [9] should therefore 
be replaced by "nearly-alignment." A formal measure of similarity (and with it a 
definition of nearly-alignment) will be given soon in Section 2. A description of the 
forms of noise we consider in this paper will be given in §3.1 and §4.2. We never- 
theless state now an informal version of our main results (see Theorems 1 and 3 for 
precise statements). 

Main result Assume that at time the positions and velocities of the Hock are 
not both too dissimilar (for a > 1, otherwise, no assumption is needed) and that 
the time step h is small enough (in case of discrete time). Then nearly-alignment is 
(quickly) reached with a certain probability and we exhibit a lower bound for this 
probability in terms of the initial similarity of positions and velocities, the variance 
of H, and the parameters k, K and a. 

The proof of our main result closely follows the proofs in [9] . Some changes had 
to be made to make room for the noise and in doing so we did a few simplifications 
as well. 

2 Some preliminaries 

2.1 Laplacians 

Given a nonnegative, symmetric, kxk matrix A the Laplacian L of ^ is defined to 
be 

L = D-A 

where D = diag((ii, . . . , (i/, ) and = aej- Some features of L are immediate. 

It is symmetric and it docs not depend on the diagonal entries of A. 

The matrix Lx in (D) and (C) is thus the Laplacian of Ax- It satisfies that for 
all u eE, Lx{u, . . . ,u) = 0. In addition, it is positive semidefinite. 

The smallest eigenvalue of Lx is zero. Its second eigenvalue is called the Fiedler 
number of Ax- We will denote it by (px- 

Proposition 1 ([10, Proposition 1]) Let A be a k x k nonnegative, symmetric 
matrix, L = D — A its Laplacian, (p its Fiedler number, and fj, = miuj^j aij . Then 
(i)>kn. □ 

2.2 Similarity and nearly-alignment 

The inner product on E naturally induces an inner product on E'^. Let A be the 
diagonal of E^, i.e., 

A = {{u,u, ... ,u) \ueE} 
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and A"*- be the orthogonal complement of A in E'^. Then, every point u G E*^ 
decomposes in a unique way as v = va + vj_ with va € A and v± G A-*-. This 
decomposition has a simple explicit form. Denote by 

1 ^ 

i=l 

the mean of the Vi. Then va = {'m, . . . , m) and v± = {yi — m, . . . ,Vk — m). This 
follows immediately from the equality 

k / *^ \ 

i=l \ i=l I 

1, — km^ = (m, 0) = 0. 



We can look at the evolution of the velocities Vi{t) decomposing into the evolution of 
their mean m{t) and that of the distances to that mean v±^ = (vi — m, . . . ,Vk — m) 

and a key observation at this stage is the fact that convergence to a common velocity 
(or nearly-alignment) is a feature of the second evolution only. More precisely, 
the condition "the velocities Vi{t) tend to alignment is equivalent to the condition 
^^v±{t) — > 0". We arc thus interested on the projection {x±{t),v±{t)) over A"*- x A"*- 
of the solutions {x(t),v{t)) of the system (D) (or (C)). It is easy to show (sec [9]) that 
these projections are the solutions of the restriction of (D) (resp. (C)) to A"*- x A"*-. 
More precisely, they are the solutions of 

x{t + h)± = x(t)± + hv{t)± 

v{t + h)± = (Id- hL^^)v{t)± + hH±. 

Hence, in what follows, we will consider positions in 

X := E'^/A ~ A^ 

and velocities in 

V := E'^/A ~ A-^. 

For x,v we will denote x = x± and v = v±. Finally, we will denote H = H±. 

It is natural now to take the norm \\x±\\ of the projection x± as the dissimilarity 
of X and similarly for Hi'.lII. In the case of x we may call this measure the dispersion 
of the flock. It relates with its "diameter." 

Lemma 1 For all x G E'^, maxj^j ||a;i — a;j|| < \/2||a;j_||. 
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Proof. Write x = x/^ + x±_ = (u, . . . ,u) + ((a;j_)i, . . . , (£Cj_)fe). Then, for all 
i ^ j, Xi - Xj = {x±)i - {x±)j and 

\\xi - xjIIe = \\{x±)i - (a3±)j||E < ll(a;±)i||E + ||(a;±)j||E < V2||«±||e'=- ^ 
The notion of similarity leads to the following definition. 

Definition 1 Let u > 0. We say that the flock {1,... ,k} is u -nearly- aligned (or 
simply nearly-aligned) when \\v±\\ < v. 



2.3 A few functions of the initial state 



The initial state of the flock is characterized by the pair {x{0),v{0)). For conver- 
gence to alignment (or to nearly-alignment) to hold one needs to require that the 
dissimilarities of these two vectors are not both large. 

We close this section with a few quantities related to these initial dissimilarities 
which will occur when describing the conditions ensuring convergence. These are 



a 



2V2 



vim, 6=1 + V2||x(0)||, 



max 



b 

l-a 



[{2a)^,2h] 



a— 1 
- 1 



V2 



and 



if a < 1 
if a = 1 
if a > 1, 

Tja 



3 Discrete time 

Assume the initial state for (D) is at time 0. Then the sequence of states is 
{x{th)^v{tK)}ten- To simplify notation we will denote x{th) simply by x[t\ and 
similarly for v. 



3.1 Statement of the result 

Recall, the random noise in (D) has the form // = (Hi, .... Hi^) and we have 
H = Ha + H. Note that the component of H corresponds to the perturbation 
of the common velocity va within A and is therefore of no consequence regarding 
convergence to alignment or nearly-alignment. 

In what follows we assume that, for alH G {1, . . . , A;}, and for all t G N, 

H,{t)={e^\t),ef{t),ef{t)) 
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where the ef^ (t) are one dimensional random variables, the coordinates of the per- 
turbation. 

We consider two possible laws for the distribution of H: 

Uniform: H^U3k{0,r) 

where, for some r > 0, J73fc(0, r) is the uniform distribution in B{0,r) C M^^, and 

Gaussian : H ~ N{0, a^Idsfe), 

a 3A;-dimensional centered Gaussian distribution with covariance matrix a^ld^^. As 
a consequence, in the Gaussian case, the random variables are independent. 
Our main result for discrete time is the following. 

Theorem 1 Consider the system (D) with adjacency matrix given by (3) . Assume 
that h satisRes 



h < min 



kK 



l/a' 



2{k-l)VkK' 2V2\\v{0)\\ \2no 
Assume also that one of the three following hypothesis holds: 

(i) a <1, 

(ii) a = l,and ||^;(0)|| < 

(iii) a > 1, and 

I \ a — 1 

— > 6 + 2kKha. 

aa J a 

Then u-nearly-alignment for some v < ||f(0)|| occurs in a number of iterations 
bounded by 

r .- 2C/o",/ 11^0)11 ■ 

with probability at least 

in the uniform case (1 if r < Hou) , and with probability at least 



r(^) 



in the Gaussian case. 
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Remark 1 For each of the cases (i), (ii), and (iii) we can replace Uq and Ho by 
their respective values. In case (iii) and with uniform noise, for instance, this yields 

and 

Hoi^ _ 2-'^-^kK / a-1 

~ r U(l + V2||x(0)||) 
Note that this means that for 

we have 

Prob{nearly-align in at most Tq iterations} > 

From these expressions it is easy to read the role of the deterministic setting pa- 
rameters h, k and the probabilistic r, the radius and the initial dissimilarities 
||a;(0)||, ||i'(0)|| both in the time required to reach nearly-alignment and in the con- 
fidence with which this occurs. 




f kK \ 

Uo(iiz(o)r) j 



Remark 2 The integral in the bound for the probability in the Gaussian case sat- 
isfies, when k is odd and writing n = the equality 

rx j.n-1 / ™n-l „ \ 

For X = ^y'Hov/{2a) and for small a this probability bound is equivalent to 

l--^e--x"-\ 

r(n) 

By L'Hopital's rule, this equivalence holds as well when k is even. 



3.2 Bounded noise 

Fix a solution (x, of (D). At a time t G N, x[t] and v[t] are elements in X 

and V, respectively. In particular, x[t] determines an adjacency matrix A^^^. For 
notational simplicity we will denote its Laplacian and Fiedler number by Lt and (pt, 
respectively. 

Lemma 2 For all x & X, 

\\L^\\ < 2(A; - l)VkK. 
In particular, if h < ^^^^ '^ll-^a;!! ^ (0' -*-]• 
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Proof. For all i,j < k, aij < K. Therefore, 

k 

i<k 



-^xllmax = max | < 2{k - l)K. 



Now use that < \/fc||i^a;||max [14, Table 6.2] to deduce the result. □ 

Proposition 2 Assume that h < — — \ r-^^ . Assume also that, for all < t < T, 
\\H\\ < T^oll^^MII- Then, for all t < T, 

\\v[t + l]\\ < {1 - h(l>t + hHo)\\v[t]\\. 

In particular, \\v\\ is decreasing as a function of t for t < T. 

Proof. The linear map Id — hL^ is self-adjoint and its eigenvalues are in the 
interval (0, 1). Its largest eigenvalue is 1 — h(f)t. Therefore 

||u[i + l]|| = \\{Id- hit) v[t] + hH\\ <\\ld- hLt\\\\v[t]\\ + h\\H\\ 

< {1 - h(^t)\\v[t]\\ + hno\\v[t]\\ = {1 - hcpt + hno)\\v[t]\\. □ 

Corollary 1 In the hypothesis of Proposition 2, for all t & {0, . . . ,T — 1} we have 

t-i 

\\v[t\\\<\\v[{)\\\\[{l-h<p^ + hHo). □ 

1=0 

A proof of the following lemma is in [8, Lemma 7]. 
Lemma 3 Let c\,C2 > and s > g > 0. Tiien the equation 

F{z) =Z' - Cizl - C2 = 

has a unique positive zero . In addition 

< max|(2ci)^, (2c2)^| 
andF{z)<OforO<z<z*. □ 

Theorem 2 Let T G NU {+oo}. Assume that, for allO<t<T, \\H\\ < 'Ho\\v[t]\\, 
and that h satisfies 



h < min 



1 I f kK 



2{k-l)VkK' 2V2\\v[Q]\\ \2no 



l/a' 



where Ho is as in Theorem 1. Assume also that one of the three following hypothesis 
holds: 



(i) a < 1, 

(ii) a = l, and\\v[Q\\\ < 

(iii) a > 1, and 



— > 6 + 2kKha. 

aa J a 



Then 1 - G (0, 1), for allO<t<T, \\x[t] \\ < Bq and 

kKV 



Ht]||<Ho]|| 1-/. 



2U^ 



In particular, when T = oo, \\v[t\\\ ^ for t — > oo. 
Proof. Let 

T = |i G {0, . . . ,r - 1} I (i + V2\\x{t)\\y < ^} . 

Note that in all three cases ((i), (ii), and (iii)) the definition of TCq implies that 
G T and hence, that T 7^ 0. Assume that T 7^ {0, ...,T — 1} and let t = 
min{{0,...,r-l}\T}. 

For t < T, let t* be the point maximizing ||x|| in {0, 1, . . . , t}. Then, by Propo- 
sition 1 and Lemma 1, for i e {0,1, . . . , t}, 

kK kK 

d)i > ^ . > 



{1 + V2\\x[{\\\r - {i+V2\\xnr 

Moreover, since t* <t<twe have 

J, _ ^ > > =• R(t*) 

°- (1 + V2||x[i*]||)" 2(1 + V2||a;[i*]||)" ' ^' 
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Using Corollary 1 we obtain, for all t <t, 

T-l 



T-l 



IxrII < 



< 



< 



< 



< 



x[0] 
x[0] 

x[0] 

x[0] 

x[0] 
x[0] 



+ J2\\x[j + l]-xm<HO]\\+hJ2H. 

j=0 j=0 

+h(^m\\+j2M 

+ h ( iit;[o]ii + ii^[o] II n (1 - ^-^^ + ^^o) 

+h\\vmY,{^-hR{ny 

j=0 

+ 



hR{t*)" 

2(i + v^l|x[r]||r 

kK 



v[0]\\. 



Multiplying by a/2 and taking t = t* , the inequality above takes the following 
equivalent form 

V2imii<V2woiii + ?^!(i±iM:ii„ 



or yet 



,i+V2wnii)<(i+v/2iwoiii)+?^Si±^fffiii. 

Let z = 1 + V2||x[r]||. Then (4) can be rewritten as F{z) < with 

F{z) =z-az'^-b. 



(4) 



(i) Assume a < I. By Lemma 3, F{z) < implies that (1 + V2||a;[t*]||) < Uq. 
Since Uq is independent of t we deduce that, for all t <t, 



V2 



Therefore, for all t < 



(1 + V2\\x[tW < (1 + ^lk[**]lir < < 



kK 
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the last by the definition of Ho- It follows that 

\m\ = Mt- i]ii + h\\v[t- i]ii < \\x[t- i]ii + himw 

and therefore 

il + V2\\x\t\\\) < 1 + V2\\x\t-1]\\ + V2h\\v[0]\\ 

the last by our hypothesis on h. This is in contradiction with the definition of t and 
shows that no such t exists. That is, for alH < T, ||a;[t]|| < Bq and 

. n, T. kK kK 

By Corollary 1, for i < T, 

t-i 



II^^WII < \\vm\llil-h4>i + hHo) < {1 - hFof \\v[0]\\. 

The convergence results for the case T = oo now readily follow (cf. [9, Theorem 3]). 
(ii) Assume now a = 1. Then (4) takes the form 

(1 + V2\\x[f]\\) (^1 - If lb[0]||^ - (l + V2\\xm) < 0. 

By hypothesis, \\v[0]\\ < This implies that 

Hf]\\<kK -1^^, 

We conclude that, for all t <t, 

by the definition of Ho- We now proceed as in case (i). 
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(iii) Assume finally a > 1. The derivative F'{z) = 1 — aaz"" ^ has a unique zero 

1 1 a 1 

1 \ CK — 1 / X \ CK — 1 / 1 \ / 1 \ 



a 



a 



a 



1 1 
1 \ / 1 \ a — 1 



a 



a 



-6 



a 



> 



the last by our hypothesis. Since F{0) = -b<0, F"{z) = a{a - l)az°'-'^ > for all 
z > 0, and F{z) — — oo when z — oo, we deduce that the shape of F is as follows: 



(0,-6) 




Figure 1 

For i e N let z{t) = 1 + V2||x[r] ||. When t = we have i* = as well and 



z{0) < 1 + V2|| 



This implies that z{0) < ze. Assume that there exists t <T such that z{t) > Zu and 
let r be the first such t. Then r = r* > 1 and, for alH < r 

1 + V2\\x[t]\\ < z{r - 1) < Zi. 

Let zq be the intersection of the z axis with the line segment joining (0, —b) and 
(2;*,F(z*)) (see Figure 1). The line where this segment lies has equation 



' + b = z 
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from which it follows that 
It follows that, for alH < r, 

lkWII<^(^^-i)<^(^o-i) = i?o. 

In particular, 

Hr-l]||<i=(z,-l). 

For r instead, we have 

\\x[r]\\>^{z^-l). 

This implies 

\\x[r] - x[r - 1]|| > ||x[r]|| - \\x[r - 1]|| > -^(zu - zg) > -^(z* - ze). (5) 

From the intermediate value theorem, there is ^ G [z£,z^:] such that -^(z*) = 
F'{^){z^ - ze). But F'{^) > and F'(^) = 1 - aa^"-^ < 1. Therefore, 

z* — Zi> F{z^) 

and it follows from (5) that 

\\x[r]-x[r-l]\\>^F{z,). (6) 

But 

\\x[r] - x[r - 1]|| = h\\v[r - 1]|| < /i||v[0]|| 

the last since \\v\\ is decreasing for t <T. Putting this inequality together with (6) 
shows that 

F{z^) < V2h\\v[0]\\ 

or equivalently, 

1 1 

-V" (-Y" ^-b<V2h\\vm\ 

a J \a J a 

which contradicts our hypothesis. This shows that, for all t < T, z{t) < Z( and 
hence, for all t <t, 

(1 + V2\\x[t]\\r < zs = ((1 + V2\\xm)^y < 2-" 



2no 

the last by the definition of Ho- We now proceed as in case (i). □ 
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3.3 Proof of Theorem 1 

Proposition 3 For e > 0, let p{e) = Prob{||i?|| < e}. Then, in the uniform case, 
we have the bound 

, 3k 

(pie) = lifr<e) while in the Gaussian case \\H/a\\'^ has a Chi-square distribution 
with 3A; — 3 degrees of freedom, and in consequence 



P(-)>(-) > 



P{e) = [ 
Jo 



3fe-5 



t— 

e'*dt. 



'3fc-3^ 



2 / 

Proof. In the uniform case, since \\H\\ < \\H\\, we have 

/ f \ 3fc 

Prob(||i/|| < £) > Prob(||l/|| < e) 



In the Gaussian case, the decomposition H = H ^ + H ±^ takes the form = 
(m, . . . , m), with m = ^ ^j=i -^-L = H — (m, . . . , m). Consequently 

\\Hf = \\H^f = ^(if, - mf = j:j:{ef - m^^^f, 
j=i 1=1 j=i 

where m = (m(^),m(^\m(^)). A standard result in statistics (see [2], page 219) 
states that Yl'j=ii^-f^ ~ fn^^^)"^ has a Chi-square distribution with k—\ degrees of 
freedom. Therefore, by independence, ||if/a"|p has a Chi-square distribution with 
3A; — 3 degrees of freedom. The expression for p{e) follows from the form of the 
density of the Chi-square random variable. □ 

We can now give the proof of Theorem 1. 

Let r > and assume that the hypothesis of Theorem 2 holds for T. Then, 



Hi]||<Ho]|| 1-/. 



kK ^ * 



2(1 + V25o)" 
where Bq is as in Theorem 2. Therefore 

kK ^ ^ 



|v[r]||<z/ <S= ||i;[0]|| l-/i j= <v 

I L Jll - II L Jll \ 2(1 + V25o)°/ - 



V V 2(1 + V2Bo 

hkK ^{'■^) = n. 
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At each iteration t, Theorem 2 requires that < HoUfii]!! for a quantity Ho 

which depends on the initial conditions and on the case ((i), (ii), or (iii)) at hand. 
If nearly-ahgnment has not occurred, then \\v[t\\\ > v and therefore, 

Proh{\\H{t)\\ < Ho\\v[t]\\} > Proh{\\H{t)\\ < nQv}>p{nQu). 

It follows that 

Prob{\\H{t)\\<n4v[t\\\ for t = 0,...,To-l} >p(7^oi^f° 
and therefore the claimed bounds in the uniform and Gaussian cases. □ 

4 Continuous time 

The goal of this section is to show a continuous time version of Theorem 1. In 
contrast with the discrete time setting, though, the description of the noise H is 
not straightforward. There is no obvious continuous time version of a sequence 
of independent and identically distributed random variables. We thus begin by 
discussing our model of noise. 

4.1 Continuous time stochastic processes 

A continous time stochastic process {X(t) | i > 0}, or for short a stochastic pro- 
cess, is a family of random variables X{t) defined in a common probability space 
($7, Prob). More precisely, the stochastic process depends on two arguments, t 
and uj, and when necessary we denote it by X{t,io). In order to describe our as- 
sumptions, we make a brief comparison with the discrete time situation. 

In the discrete time situation it is natural to assume that each perturbation 
X{t) is a centered random variable (i.e., E(X(t)) = 0), that for all values of t 
the random variables X{t) are mutually independent, and that they have a common 
distribution. Making some additional assumption on the distribution of each random 
variable, typically assuming normality with a given standard deviation a, completes 
the specification of the noise probabilistic structure. Let us call this structure a 
Gaussian white noise sequence. 

Unfortunately, in continuous time we do not have a reasonable tractable model 
that shares the properties of the Gaussian white noise sequence. To understand 
why this is so we need to consider the properties of the trajectories of the processes, 
i.e., the curves obtained when G is fixed and t ranges in the interval [0, oo). 
Assuming independence of the random variables of the stochastic process for close 
values of t makes the trajectories of the process to have an extremely irregular 
behaviour. Consequently, if we want the trajectories of the process to be continuous, 
or differentiable, we can not assume the independence of the random variables for 
pairs of close values of t. 
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To analyze this difficulty consider first, in the discrete time case, the accumulated 
perturbation produced by a Gaussian white noise sequence {X{t) \ t G N}. That is, 
consider the sequence of sums 

5(0) = 0, S{t) = X{l) + ---+X{t), t = l,2,.... 

This random sequence associated to the Gaussian white noise sequence, called Gaus- 
sian random walk, does have a natural counterpart in the continuous time case, 
known as Wiener process or Brownian motion. A Wiener process W = {W{t) \ 
t > 0} is a continuous time stochastic process satisfying the following properties, 
that are natural extensions to continuous time of the properties of the sequence 
{S{t) \teN}: 

(a) W{0) = 0. 

(b) The increments of the process are independent random variables. That is, for 
all < io < *i < ■ ■ • < the random variables 

W{h) - W{to), W{tn) - W{tn-l), 

are independent. 

(c) The increments are homogeneous and Gaussian. That is, for all t,h > 0, the 

random variable 

W{t + h)- W{t) 
has a centered Gaussian distribution with variance h. 

(d) Finally, the trajectories of W are continuous. That is, the curves {W{-,(jj)} 
obtained when a; G O is fixed and t>0 are continuous functions of t for almost 
all u efl. 

In the discrete time case the Gaussian white noise sequence can be recovered from 
the sums S{t) by taking differences X{t) = S{t) — S{t — 1). In the continuous time 
case we would like to have a formula like X{t) = W{t), but it is not immediate 
to give a sense to this last time derivative, as it is known that the trajectories of 
the Wiener process are nowhere differentiable [4]. A possible way out is to first 
take a differentiable approximation of the Wiener process and then take the time 
derivative of this approximation as a model of noise in the continuous time case. 
Another, alternative, way out is described in Remark 5 below. 

The approximation is obtained by convolution with a smooth kernel. Let i/; : M — > 
M+ be a function, with compact support, say supp('0) C [—1/2,1/2] and such 
that J^ip{x)dx = 1. For 6 > consider 



16 



that has supp(V'5) C [-5/2,5/2]. 

The approximation = {W^{t) | t > 0} of the Wiener process is obtained by 
convolution with -i/j^ in the fohowing way: 

W\t) = {i^s * W)it) = [ i>sit - s)Wis)ds = [ '^si-w)W{t + w)dw, (7) 

where, if s < 0, we replace W{s) in the integrand by W{—s), where W is another 
Wiener process independent of W. 

Observe that the process inherits the regularity properties of ^. In particu- 
lar, it has trajectories. We now define a noise process as the time derivative 

X^^t) = ^W\t) = l^§-^{Mt - s))W{s)ds (8) 

= \ [ ip{-w) {W{t + 5w) - W{t - 5/2)) dw (9) 

(> Jr 

- s)dW{s). (10) 



Here the second equality in (8) is obtained by differentiation under the integral sign, 
(9) after a change of variables s — t = 5w and using that 'tp{—w)dw = 0, and (10) 
by integration by parts departing from (8) (note, this involves a stochastic integral 
[21]). 

Using (10) and Ito's isometry for the stochastic integral [21, Theorem 4.2] we 
obtain (we write X instead of X^), for /i > 0, 

B{X{t + h)X{t)) = B( [ Mt + h-s)dW{s) I il^sit - s)dW{s)] 

\Jr Jr J 

= / ips{t + h - s)ipsit - s)ds. (11) 
Jr 

Taking ^ = 0, we obtain 

E{X{t)^)= [ ^s{t-s)Mt-s)ds=\\^Psf = h^l;f. (12) 
Jr 

Taking h>5m. (11) and using that supp(V'5) C [—5/2, 5/2] we obtain 

E (X(t + /i)X(i)) = 0. (13) 

Furthermore, from the different expressions in (8—10) we obtain the following prop- 
erties of the process {X{t) | t > 0}: 

(a) It is a centered Gaussian process with variance = (V<^)IIV'lP- That is, 

X{t) - N{Q,{l/5)\\i^f) for ah t G [0,r]. 
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(b) It is a stationary process. That is, for all times ti, . . . ,tn, intervals /i, . . . , /„, 

and time increment h, we have 

Prob(X(ti) eh,.. .,x{tn) e In) = Prob(X(ti + h)eli,.. .,X{tn + h)e In). 

(c) It is S-dependent. That is, the two sets of random variables 

{X{s) \s<t} and {X{s) \s>t + 6} 
are independent for each t. 

The first property can be obtained from (8) since the integral there is the limit of 
a linear combination of Gaussian random variables. Indeed, such a linear combi- 
nation remains Gaussian and the limit of the resulting random variables preserves 
Gaussianity as well. This variable is centered since all the involved variables are 
centered, and the limit defining the integral preserves the expectation. The value 
for the variance follows from (12). 

The stationarity property of X(t) is inherited from the stationarity of the incre- 
ments of the Wiener process. In order to see it we use the representation (9). The 
probability distribution of the stochastic process + w) — W{t — 5/2) \ —S/2 < 
w < 6/2} does not depend on the value of t, or more precisely, the probability 
distribution of {W{ti + w) - W{ti - 5/2) \ -5/2 <w < 5/2, i = 1, . . . , n} coincides 
with the probability distribution of {W{ti + h + w)- W{ti + h- 5/2) \ -5/2 <w< 
5/2, 1 = 1,..., n}. This makes the process probabilities invariant under a shift of h, 
i.e. the process satisfy the definition of stationarity in (b) above. 

Finally, the (5-dependency is a consequence of (13). This equality give us non- 
correlation, when the lag h> 5. The independence follows since a Gaussian vector 
without correlation has independent components. It should be noticed that this 
property is not essential to our developcmcnt below; it simply mimics the discrete 
time independence. Furthermore, it is possible to derive similar results in this 
discrete time case for a weakly dependent noise. 

Remark 3 The contents of this section is not new. It is exposed in certain detail 
for ease of the reader. Regarding the equalities in (8-10), it can be seen that any 
centered Gaussian process admits such a representation with an adequate kernel. 
General results on Gaussian processes and their diverse applications can be found 
for instance in [1, 3, 7]. 

4.2 Statement of the main result 

We can now describe the noise H in (C) and state a continuous version of Theorem 1. 

We assume that Hi{t) is a three dimensional Gaussian centered, stationary 
stochastic process, that satisfies a (5-dependence condition for some 5 > 0, has 
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trajectories, and independent coordinates. More precisely, we assume that Hi{t) = 
(^ef^^ (i)) ^i^^ ) where each coordinate is given by 

ef (t) = aVd [ Mt - s)dWP{s) (14) 
Jr 

where ips is a kernel as in §4.1, CT > 0, and {wP{t) \ t>0} is a set of 3k independent 
Wiener processes. That is, each coordinate of is of the form a\f6X^ with as 
in §4.1. Note that the variance Var {ef\t)^ = for ah t > 0. 

Theorem 3 Consider the system (C) with adjacency matrix given by (3) and noise 
given by (14). Let xo,vo G E. Then, there exists a unique solution {x{t),v{t)) of 
(C), defined for all t G M, with initial conditions x{0) = Xo and v{0) = vq. Assume 
that one of the three following hypothesis holds: 

(i) a<l, 

(ii) a = l, and |K0)|| < 

(iii) a > 1, and 

1 \ ^ a - 1 

— > b. 

aa J a 

Then i^-nearly-alignment for some v < ||f (0)|| occurs before time 

kK \ u 
with probability at least 

\ \a\/3kj 6^/2^ \aV3kJ J 

where *(y) = e ^du is the standard normal distribution and (p = ^' its 

density. 

Remark 4 Using the identity 



1 _ *(a;) = ^ ( 1 + 



and performing some elementary computations we obtain that the bound in (15) is 
equivalent, for small a, to 
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Remark 5 An alternative way to model the noise in our system relies on the simi- 
larity of the Gaussian random walk and the Wiener process. Integrating the second 
equation in (C) we obtain 



v{t) = v{0) 



/ Lxv(s)ds + / H{s)ds. 
Jo Jo 



The last term in the right-hand side is the accumulated noise for which we noted in 
§4.1 that the natural continous time version is the Wiener process. Multiplying the 
latter by cr > (as we did at the beginning of §4.2 to obtain \/ar{H) = aHd^k) we 
obtain ^ 

v{t) = v{0) - / L^v{s)ds + aW{t) 
Jo 

and integral equation often written in its "differential form" 

dv{t) = -L^v{t)dt + adW{t). 

Hence, an alternative to (C) is the system of stochastic differential equations 

dx = vdt (SDE) 
dv = —Lxvdt + adW. 

The construction of a solution for this system relies on the stochastic calculus de- 
veloped by Ito [16]. 

We note that, while it is possible to prove that —>■ W when (5 — > 0, it is 
not generally true (cf. [19]) that the solution of a system of stochastic differential 
equations driven by a smoothed noise converges towards the solution of the 

corresponding system driven by the original noise W. Investigating whether this is 
the case for (C) and (SDE) would take us out of the scope of the present work. 

To prove Theorem 3 we follow the steps in the proof of Theorem 1. 



4.3 Bounded noise 

For X G X we denote r(a;) = ||a;||^ and ior v £ V we denote A{v) = ||f ||^. 

In this section we fix T G (0, oo) U {+00} and a solution {x, v) of (C) (which 

we assume exists and is, almost surely, differentiable in [0, T)). The meaning of 

expressions like (f)t, Lt, A(t), or r{t) is as described in §3.2. 

Denote = (pr- 
re[o,t] 

Proposition 4 Assume that, for all < t < T, \\Ht\\ < \\v{t)\\no. Then, for all 
0<t<T, 

\\v{t)\\ < ||?;(0)||e-*(**-^o). 
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Proof. Let r G [0,t\. Then 



= 2{v'ir),vir)) 

= -2{LrV{T) + Hr,v{T)) 

= -2{LMr),v{T)) -2{H^,v{t)) 

< -2,/.,(,)A(T) + 2||i/,||||t;(r)|| 

< -2A(t)(</>,(,)-Wo)- 

Here we used that is symmetric positive semidefinite on V. Using this inequahty, 

t rt Ml 



ln(A(r)) 



= firr\dT< I -2{(t>r-Ho)dT<-2t{^t-no) 

Jo HV Jo 



I.e., 

ln(A(t)) - In(Ao) < -2t{^t - Ho) 
from which the statement follows. 



□ 



Proposition 5 Assume that $t > Ho for aUO<t<T. Then, for aUO<t<T, 



\x{t)\\<\\x{0)\\ + 



Proof. For r < t we have |r'(r)| = |2(w(r), x(r))| < 2||?;(r)||||x(T)||. But 
||a;(T)|| = r(r)V2 and ||v(r)f = A(r) < Aoe-^-^^^^-^o), by Proposition 4. There- 
fore, 

r'(r) < |r'(r)| < 2 (Aoe-'^(*--^o)) r(r) ^2 (16) 
and, using that r i— >^ $r is non-increasing and that — 'Ho > for all t <t, 



Jo r> 



r(r)V2" 



dr < 2 /7Aoe-2-(*-^o)y^'dr 



'0 



< 2 / Al^'e-<^^-^°^dT 
Jo 



2A 



1/2 



,-T(*t-Wo) 



the last inequality because $t > TYq. This implies 



< 



2Ay^ 



r(r)V2 



1 r*IM,,,<^ 



r(r)V2 
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from which it follows that 

The main result in this section is the following. 

Theorem 4 Assume that, for allO <t <T, ||-H"(i)|| < ||'u(t)||'^o- Assume also that 
one of the three following hypothesis hold: 



(i) a < 1, 

(ii) a = l, and||^;(0)||<^, 

(iii) a > 1, and 



a— 



1 

1 \ a — 1 

> b. 



a/ a 

Then, for allO<t<T, \\x{t)\\ < Bq and 

\Ht)\\ < \\v{0)\\e ^0 

In particular, when T = oo, \\v{t)\\ for t — > oo and there exists x & X such that 
x{t) — > X when t — oo. 



Proof. Let 



T = !^tG[Q,T)\{l + V2\\x{t)\\r<^Y 



Note that in all three cases ((i), (ii), and (iii)) the definition of Ti.Q implies that 
G T and hence, that T / 0. Assume that T / [0,r) and let t = inf{[0,r) \ T}. 
Clearly, 1 + V2||x(f)|| = ^. 

By Proposition 1 and Lemma 1, for all x e X, 

kK kK 

> ^ r. ^ > 



{1 + m&Xi^j \\xi - XjW)^ - (1 + v^||x||)"' 

Let t <t and t* G [0,t] be the point maximizing in [0,t]. Then 

^ < . kK kK 
$t = mm 07- > mm — > 



rem re[o,t] (1 + V2||x(t)||)" {1 + V2\\x{t*)\\)"' 
Moreover, since t* < t < t, t* e T and we have 

<^t-no> ^ no> ^ > 0. (17) 

(1 + V2||x(t*)||)" 2(1 + V2||x(t*)||)" ^ ' 
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Hence, we may apply Proposition 5 to obtain 

1 



\xm < ||x(0)|| + ||x;(0)|| 



< H0)|| + ^H0)ll(i + V2|miir_ ^^^^ 

kK 

Since t* maximizes F in [0,t] it also does so in [0,t*]. Thus, for t = t*, (18) takes 
the form 

(l + V2\\x{n\\) - 2v^||.(0)||^i±^^||^^ - (l + V2||x(0)||) < 0. (19) 

Let z = 1 + \/2\\x{t*)\\. Then (19) can be rewritten as F{z) < with F{z) = 
z — az^ — b. One can now finish the proof by dividing in cases as in Theorem 4 and 
following the steps in its proof. □ 



4.4 Proof of Theorem 3 

We begin with a 
described in S4.2. 



We begin with a result on the behaviour of the maximum of the processes e- 



Proposition 6 Fix T > 0. Denote e{t) = ef\t) and 

p(x) = Prob I max |e(t)| < x] . 

\0<t<T J 

Then 

l-p(x) = Probf max \e{t)\ > x) <2 I^Ml^ (-] + 1 - ^ (-) 
\o<t<T' - y - 1^ ^ y^j y^j 

Proof. The proof is an application of Davies's inequality [12] (see also Chapter 4 
in [3]): 

Prob ( m^^e(0 > .) < -^^{^) £ V^)dt + 1 - *(^)- 
where rii{t,t) = (e(i)^) and therefore, using (11), 



rn{t,t) - ^2 
The conclusion now follows from the trivial bound 



Prob I max |e(t)| > x ) < 2 Prob ( max e(t) > x] . □ 

\0<t<T J \0<t<T ) 
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We can now give the proof of Theorem 3. The existence of a unique solution 
follows, for each uj e ^, from [15, Chapter 8]. 

Using that < \/3^||-ff(0lloo) and Proposition 6 for one coordinate we 

obtain 

Prob ( max \\H(t)\\<e] > Prob f max \\H(t)\\<£] 

\0<t<T J \0<t<T J 

> Prob I max max max |e,-^^ (t) \<^= ) 

Vo<t<Tl<j<fcl<^<3' ' ^/SkJ 



Prob ( max |e (t) |<^= ) 



3fc 



> O - 2 



Ta\m f e \ . / e 



6V2^ \aV3kJ ' ^ ^ \aV3k 

>, 3k 



3k 



I \a^MJ SV2^ \aV^J j 

Similarly as in the proof of Theorem 1, but taking into account that now v{t) is a 
continuous function we define 

T{u;) = M{t>0\\\v{t)\\<,^}, ro = ^ln'^"''^°^ll 
Taking e = i/Ho we now see that T{uj) < Tq with probabiUty at least 

\aV3kJ SV2^ "^KaVSkJ J 

Let us then take to in the set maxo<t<ro || < i^'Ho. By our previous compu- 

tation, this set has a probability not smaller than the bound (20). If T{aj) > Tq, 
then ||f(t)|| > on the interval [0,To], Theorem 4 holds for T = Tq, and we obtain 
\\v{t)\\ < u for some t < Tq, obtaining a contradiction. This concludes the proof. 

□ 
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